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Abstract 

Wc provide rates of convergence to the asymptotic distribution of 
the (properly scaled) degree of a fixed vertex in two preferential at- 
tachment random graph models. Our approach is to show that these 
distributions are unique fixed points of a certain distributional trans- 
formation that allows us to obtain rates of convergence using a new 
variation of Stein's method. Despite the large literature on these mod- 
els, there is surprisingly little known about the limiting distributions 
so we also provide some properties and new representations includ- 
ing an explicit expression for the densities in terms of the confluent 
hypergeometric function of the second kind. 

1 INTRODUCTION 

Preferential attachment random graphs are random graphs that evolve by se- 
quentially adding vertices and edges in a random way so that connections to 
vertices wi th high degree are favored. Following the publication of Barabasi 
and Albert 11999^ there has been an explosion of research surrounding these 



types of models; the seminal reference in the mathematics literature is Bol- 
lobas, Riordan , Spen cer, and Tusnady (|200ll ). One of the main results of 



Bollobas et al.1 (120011 ) is a rigorous proof that the degree of a randomly cho- 
sen vertex in a particular family of preferential attachment random graph 
models converges to the Yule-Simon distribution. Corresponding approxi- 
mation results in total variation for this and rela t ed pr efere ntial a t tachm ent 
models can be found in |Pek5z. Rollin. and Rossi (j2O10h and|Ford| (j2009h . 



Here we study the distribution of the degree of a fixed vertex in two 
preferential attachment models. In Model 1 we start with a graph G2 with 
two vertices labeled one and two with an edge directed from vertex two to 
vertex one. Given graph G n , graph G n+ \ is obtained by adding a vertex 
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labeled n + 1 and adding a single directed edge from this new vertex to a 
vertex labeled from the set {1, . . . , n}, where the chance that n + 1 connects 
to vertex i is proportional to t he degree of vertex i in G n . Model 2 is one 
studied in Bollobas et al. ( 200ll ) and allows for self-connecting edges. There, 
we start with a graph G\ with a single vertex labeled one and with an edge 
directed from vertex one to itself. Given graph G n , graph G n +i is obtained 
by adding a vertex labeled n + 1 and adding a single directed edge from 
this new vertex to a vertex labeled from the set {1, . . . , n + 1}, where the 
chance that n + 1 connects to vertex i G {1, ... ,n} is proportional to the 
degree of vertex i in G n and the chance that vertex n + 1 connects to itself 
is l/(2ra + l). 

Let W n> i be the degree of vertex i in G n under either of the models 
above. Our main result is a rate of convergence in the Kolmogorov metric 
(defined below) of ^^/(EW^) 1 / 2 to its distributional limit as n — > oo. 
Though the literature on these models is large, there is surprisingly little 
known about these distributions. The fact that these limits exist for the 



first model has been shown in iMoril (|2005l ) a nd iBackhausa (|201ll ) and the 
same result for both models can be read from Janson (|200fih by relation to 
a generalized Polya urn, although the existing descriptions of the limits are 
not very explicit. The primary tool we use here to characterize the limits and 
obtain rates of convergence is a new distributional transformation for which 
the limit distributions are the unique fixed points. This transformation 
allows us to develop a new variation of Stein's method; we refer to Ross 
and Pekoz (l2007h and lChen. Goldstein, and Shaol (|201ll ) for introductions 
to Stein's method. 

To formulate our main result, we first define the family of densities 



k s (x) 




exp( 



i)C7( s -l,|,|i), for x> 0,0 1/2, (1.1) 



where T(s) denotes the gamma function and U(a, b, z) denotes the confluent 
hyper geomet ric function of the second kind (also known as the Kummer U 
function); see lAbramowitz and Stegun (1 19641 . Chapter 13). We denote by K s 
the distribution function defined by the density k s . Define the Kolmogorov 
distance between two cumulative distribution functions P and Q as 



d K (P,Q) = sup|P(x) - Q(x)\ 



Theorem 1.1. Let W n i be the degree of vertex i in a preferential attachment 



graph on n vertices defined above and let b. 
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n,i 



For Model 1 with 



2 



2 < i ^ n we have 



d K (^(W n ,i/6 nj i),^-i) < 4= 



77 



and /or Model 2 with 1 ^ i ^ n we have 
/or some constant C independent of n. 

Remark 1.1. Using Proposition 12.51 below, we see an interesting difference 
in the behavior of the two models. In Model 1 the limit distribution for 
the degree of the first vertex (which by symmetry is the same as that for 
the second vertex) is K\, the absolute value of a standard normal random 
variable, whereas in Model 2 the limit distribution for the first vertex is 
K1/2, the square root of an exponential random variable. 

Remark 1.2. To ease exposition we present our results as rates, but the 
constants are recoverable (although probably not practical, especially for 
large i). 

Theorem 11.11 will follow from a more general result derived by developing 
Stein's method for the distribution K s . The key ingredient to our framework 
follows from observing that K s is a fixed point of a certain distributional 
transformation which we will refer to as the "s-transformed double size bias" 
(s-TDSB) transformation, which we now describe. 

Recall for a non- negative random variable W, we say W' has the size 
bias distribution of W if 

E{Wf(W)} = 1EW1Ef{W'), 



for all / such that ElV7f(W)l < oo; see iBrownl (120061) and Arratia and 



Goldstein (J2010|) for surveys and applications of size biasing. We will write 



W" to denote a random variable having the size bias distribution of W; 
alternatively, we say W" has the double size bias distribution of W and it is 
straightforward to check that 

1E{W 2 f(W)} = Elf 2 ®f(W"); (1.2) 

see also Goldstein ( 20071 ). Now, we have the following key definition. 
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Definition 1.3. For fixed s ^ 1/2, let U\ and U2 be two independent 
random variables, uniformly distributed on the interval [0, 1], and let Fbea 
Bernoulli random variable with parameter (2s) -1 , independent of U\ and Ui- 
Define the random variable 

V := Ymax(U 1 ,U 2 ) + (1 - Y) min(E/i, U 2 ). 

We say that W* has the s-transformed double size biased (s-TDSB) distri- 
bution of W, if 

^{W*) =&(VW"), 
where W", the double size bias of W, is assumed to be independent of V. 

Our next result essentially states that the distribution K s is the unique 
fixed point of the s-TDSB transformation. That is, Sf(W) = ££{W*) if and 
only if W ~ K s . Furthermore, the closer a distribution is to its s-TDSB 
transform, the closer it is to the K s distribution. Besides the Kolmogorov 
metric, we also consider the Wasserstein metric between two probability 
distribution functions P and Q, defined as 

d w (P,Q) = J \P(x) - Q(x)\dx. 

Theorem 1.2. Let W be a non-negative random variable with MW 2 = 1 
and let s ^ 1 or s = 1/2. Let W* have the s-TDSB distribution of W and 
be defined on the same probability space as W . Then, if s ^ 1, 

d w (J?(W),K s )^8s(s + ^ + ^E\W-W*\, (1.3) 

and, for any (3^0, 

d K {^(W),K s ) < 53s/3 + 34s 3/2 P[|W- W*\ > /?]. (1.4) 
If s = 1/2, then 

dw(Jf(W),K 1/2 ) < 2E\W-W*\. 

and 

d K (&(W),K 1/2 ) < 26(3 + 8P[\W -W*\ > /?]. 

Remark 1.4. In the case that s = 1, V is uniform on (0, 1) so that we 
obtain the interesting fact that J£{W) = Jtf(UW") for U uniform (0, 1) and 
independent of W" if and only if W is the absolute value of the standard 
normal random variable; see Proposition 12.51 below. 
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Although there are abstract formu lations for dev eloping Stein's method 
machinery for a given distribution, see iReinertl (|20Q5h . our framework below 
does not adhere to any of these directly since the characterizing operator we 
use is a second order differential operator (see (|3.ip and (|3.2p below). For 
the distribution K x ; the usual fir st order St ein operator derived from the 
density approach of lReinert (|2005h (following [stein] (jl986h l is a complicated 
expression involving special functions. However, by composing this more 
canonical operator with an appropriate first order operator, we are able to 
derive a second order Stein operator (see ()3.4f) below) which has a form 
that is amenable to our analysis. This strategy may be useful for other 
distributions which have first order operators that are difficult to handle. 

The usual approach to developing Stein's method is to decide on the 
distribution of interest, find a corresponding Stein operator, and then derive 
couplings from it. The operator we use here was suggested by the s-TDSB 
transform which in turn arose from the discovery of a close coupling in 
the preferential attachment application. We believe this approach of using 
couplings to suggest a Stein operator is a potentially fruitful new strategy 
for extending Stein's method to new distributions and applications. 

There have been several previous developments of Stein's method us- 
i ng fix ed points of distributional transformations. I Goldstein and Reiner t 



(|l997l ) develops Stein's method using the zero-bias transformation for which 
the normal distribution is a fixed p oint. Letting U b e a uniform (0,1) ran- 
dom v ariable independent of all else, Goldstein ( 20091 ) and Pekoz and Rollin 
(|201ll ) develop Stein's method for the exponential distribution using the fact 
that W and UW' h ave the same distribu t ion if and only if W has an expo- 
nenti al distribution: Pakes and Khattreel (1992 - ) and Lyons. Pemantle, and 
Peres |l995|) also use this property. We will show below that W and UW" 
have the same distribution if and only if W is distributed as the absolute 
value of a standard normal random variable. In this light, this paper can be 
viewed as extending the use of these types of distributional transformations 
in Stein's method. 

The layout of the remainder of the article is as follows. In Section [2] 
we discuss various properties and alternative representations of K s , in Sec- 
tion [3] we develop Stein's method for K s and prove Theorem 11.21 and in 
Section [4] we prove Theorem 11.11 by constructing the coupling needed to 
apply Theorem 11.21 and bounding the appropriate terms. 
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2 THE DISTRIBUTION K< 



In this section we collect some facts about K s ; recall the notation and defi- 
nitions associated to the formula for the density n s (x). Let us collect some 
proper ties of the Kummer U function from Chapter 13 of Abramowitz and 
Stegun ( 1964 ). The notation U'(a.b.z) refers to the derivative with respect 
to z, and the left it alic labeling of the formu l as be low corresponds to the 
equation numbers of lAbramowitz and Stegunl ()1964l ). 

Lemma 2.1. Let a,b,z 6 R. 
(13.1.29) U(a,b,z) = z^ b U(l + a - b, 2 - b, z)\ 



(13.2.5) ifa,z>0,U(a,b,z) 



1 



e- zt t a -\l + t) 



6-a-l 



(2.1) 
dt- (2.2) 

r W Jo 

(13.4.21) U'(a,b,z) = -aU(a + 1,6 + l,z); (2.3) 
(13.4.24) (l + a-b)U(a,b-l,z) = (l-b)U(a,b,z)-zU'(a,b,z); (2.4) 



(13.4.25) U{a, b, z) - U'(a, b, z) = U(a, b + 1, z); 

(13.4.21) U(a-l,b-l,z) = (1 - 6 + z)U(a, b, z) - zU'(a, b, z); 

(13.5.2) forz>0, U(a,b,z)~ z - a (z -> 00); 

(13.5.10) fora>~l, U(a, \, 0) = T(i)/r(a + \) 

As a direct consequence of ()2.5[) we have 

§- z {e- z U(a, b, z)) = -e- z U(a, b + 1, z); 

combining (|2.3p and (|2.7p we find 

U(0,b,z) = l; 

and using (|2.ip and (|2.10p implies that for z > 0, 

U(-\,\,f)=z. 

By comparing integrands in (|2.2|) . we also find the following fact. 

Lemma 2.2. Let < a < a' , b < b' , and z > 0. Then 

T(a)U(a,b, z) > T(a )U(a ,6, z) and U(a,b,z) < U(a,b',z). 

The next results provide simpler representations for K s . 



(2.5) 
(2.6) 
(2.7) 
(2.8) 



(2.9) 



(2.10) 



(2.11) 
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Proposition 2.3. If X andY are two independent random variables having 
distributions 

'B(M-l), ifs>\, 
5(1/2, s - 1/2), tfl/2<s<l, 



where B(a,b) denotes the beta distribution, and 
Y 



fr(i/2,i), ifs>i, 

\Exp(l), i/l/2<a<l, 



where T(a,b) denotes the gamma distribution and Exp(A) the exponential 
distribution, then 

V2sXY ~ K s . 

Proof. Let s > 1 and observe that by first conditioning on X, we can express 
the density of y/2sXY as 

Ps(x) := ^^I 1 ) exp(-f )y~y\l - y)-*dy. (2.12) 
V S7r JO 

After making the change of variable y = 1/(1 +£) in (12.12p . we find 

P.(s) = ^^^exp(^±l))^ 2 (l + i)V^, 

and now using (|2.2p in the definition of k s implies that k s = p s . 

Similarly, if 1/2 < s ^ 1, then we can express the density of y/2sXY as 

Qs(x) := I}Z )X n / 1 exp(^) 2r 3 / 2 (l- 2/ )- 3 / 2 <i y , (2.13) 

S^TTl (S — 2) JO 

and after making the change of variable y = 1/(1 + 1) in (12.13|) . we find 
Sy/irT(s - ~) Jo v y 

= r( s )yA exp( ^)-^ C / (s - 1,3/2,1;), 

where we have used (12. 2p in the second equality. Applying (12, ip to this last 
expression implies k s = q s . □ 

The previous representations easily yield useful formulae for Mellin trans- 
forms. 
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Proposition 2.4. If Z ~ K s with s ^ 1/2, then for all r > —1, 



sy/2 r(a)r(r + l) 
2/ 



r § + *) 



(2.14) 



Proof. Using Proposition 12.31 and well known formulas for the Mellin trans- 
forms of the beta and gamma distributions, we find 



EZ r = (2s) 



r/2 



r( s )r(§ + i)r(§ + ±) 



r(5 



+ s)T(l) ■ 
An application of the gamma duplication formula yields 

r(§ + i)r(§ + ±)=r(i)2-'T(r + i), 

which combined with (|2.15p implies (12. 141) . 

In a few special cases we can simplify and extend Proposition 
K s (x) denotes the distribution function of of K s . 

Proposition 2.5. We have the following special cases of K s : 



(2.15) 



□ 

Below 



(i) k 1/2 (x) = 2xe 
(ii) Ki(x) = {2/Ti) l l 2 e- 
{Hi) lim K s {x) = 1 — < 



x 2 /2 



Proof. The Identities (z) and (ii) are immediate from ()2. and (|2,10p . 
respectively. Using Stirling's formula for the gamma function to take the 
limit as s — > oo for fixed r in (|2.14|) yields the moments of Exp(\/2) which 
proves (Hi). □ 

Remark 2.1. As discussed below, the preferential attachment model we 
st udy is a special case of a generalized Polya triangular urn scheme as studied 
by I Jansonl (|2006l ). The limiting distributions in their Theorem 1.3 part (v) 
with a = 2 and (5 = 7=1 include K s . In fact, Example 3.1 of I Janson 
(120061 ) discusses these limits, but with the exception of the case s = 1, it 
does not appear that the decomposition of Proposition 12.31 has previously 
been exposed. On the oth e r han d, up to a scaling factor, the moment formula 
of Theorem 1.7 of Janson ( 20061 ) simplifies to that of Proposition [2~4l for K s . 
In a related v ein, K s also has moments of gamma type; see the survey by 



Jansonl (120101 ) . in particular Section 9. 
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Additionally, if Z ~ K s , then Z 2 /(2s) ~ £?(!, 1/2; s) for s > 1/2 wher e 
D(a,b;c) is a Dufresne law as defined in Chamavou and Letac ( 19991 ). 
Dufresne laws are essentially a generalization of products of independent 
beta and gamma random variables and they also have moments of gamma 
type. 

We now collect one more fact about K s , which will also prove useful in 
developing the Stein's method framework below. 

Lemma 2.6 (Mills Ratio for K s ). For every x ^ and s ^ 1, 

i f°° , , , f R s 



2 

Proof. By making the change of variable |j = z and then applying (|2.1 
and ([2J5]), we find 

K s (y)dy = — /=" £2 z" 1/2 exp(-2;)?7(s - 1, 

poo 



/>oo 

L exp(-z)C/(s- \,\,z)dz 

2r 



so that 



^ex P (^)[/( S -I,I,|J), 
1 f°° Ks (y)dy = (2.16) 



«.(*)./, V 2 C/( S -l,±,g) 

First note that by applying (12. ip in the denominator of the final expression 
of (|2.16p we have 



where the inequality follows by Lemma 12.21 

Now applying (|2.ip in both the denominator and the numerator of (|2,16p , 
we find 

«.Wi, V 2(7( a - i,l,f|) V 2 r(«) 

where again the inequality follows by Lemma [2. 2i Now applying Lemma [2. 71 
below to this last expression and combining with (|2.17p yields the lemma. 

□ 
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Lemma 2.7. If s ^ 1, then 

JsC(s-\) 
1 ^ T(s) < ^ 
Proof. Theorem 1 of Bustoz and Ismaill ( 19861 ) implies that 

v^r( s -i) 



(2.18) 



T(s) ' 

is a decreasing function on (1/2, oo), so that for s ^ 1 (|2.18p is bounded 
above by i/tt. Moreover, Stirling's formula implies 

lim ^ r / S ' > =1. □ 
S^OO ^r(s - i) 

3 STEIN'S METHOD FOR K s 
In this section we develop Stein's method for K s and prove Theorem 11.21 

Lemma 3.1 (Characterizing Stein operator). If Z ~ K s for s ^ 1/2, then 
for every twice differentiable function f with /(0) = /'(0) = and such that 
E|/"(Z)|, E\Zf'(Z)\, andE\f(Z)\ are finite, we have 

E{sf"{Z) - Zf'(Z) - 2(s - l)f(Z)} = 0. (3.1) 

Proof. Let C s := y/2T(s)/y/slr. Using integration by parts and (|2.3[) . 

E{sf"{Z)} 

/•oo 

= C S / s f"( x )e W (^rP{s-l,H)dx 
Jo 

= C s /'(x)xexp(^) (U(s - 1, i §£) + (a - 1)C7( S , §, W 



/•oo 

= E{Z/'(Z)}+C S / f'( x ) . ( s - l) X exp(=g-)U{s,H)dx. 
Jo 

Using again integration by parts and then <\2M . 

roo 

Jo 

POO 

= C S / /(x)-2( S -l)exp(^) 
J o 

POO 

= C S / /(x)-2( S -l)exp(^)C/( S -l,i,g)dx 

JO 

= E{2( S -1)/(Z)}. 
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Hence 

M{sf"(Z)} = E{Z/'(Z)} + E{2( S - l)f(Z)} 
which proves the claim. □ 

For the sake of brevity, let V s (x) := f7(s — 1, |, |^) . 

Lemma 3.2. The second order differential equation 

sf(x) - xf'(x) - 2(a - l)/0) = h(x) - m(Z) (3.2) 

has solution 



1 P' 1 f y ~ 

f{x) = -V s {x) / -— — -— / h{z)K s {z)dzdy 
s Jo V s {y)K s {y) J 

1 f x 1 f°° ~ 

= — V s (x) / —— — — / h(z)K s (z)dzdy 
s Jo Vs{y)K s {y) J y 



(3.3) 



where h = h- E/i(Z). 

Proof. First we prove that we can write (|3.2p as 

sg'(x) - s(f - d(s))<7(x) = /'(*) - d(x)/(x) = g(x) (3.4) 

with 

d V'(x) 
d (x) = —\ogV s (x) = -fyj. (3.5) 
ox V s {x) 

In order to see this, first combine the two equations of (|3.4p to obtain 

sf"{x) - xf'(x) - (sd'(x) + sd{xf - xd(x))f{x) = h(x) 
Hence, we only need to show that 

sd'(x) + sd(x) 2 - xd(x) = 2(s - 1). (3.6) 
In order to simplify the calculations, let us introduce 

d , ix U'(s-l±,z) 

DW = _ logF( .-,, i ,,) = _i_^ i 

note that d(x) = f-D(fj)- With this and z = ()3.6|) becomes 

(± - *)D(z) + + zD(z) 2 = s - 1. (3.7) 
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The left hand side of (|3.T|) is equal to 

(±-z)U'(s-l,±,z)+zU"(s-l,±,z) 

A-k + zMs,lz)-zU'(s,lz) 
~ [ } U(s-l,i,z) ~ S 1? 



where we used f|2.3j) and then (|2.6p . Hence (|3.6|) holds and therefore (|3.4p is 
equivalent to ()3.2|) . 

Let us now proceed to solve (|3.4p . Note first that the general differential 
equation 

F'(x) - A'(x)F(x) = H(x), x^O, F(0) = 0, 

has solution 

F(x) = e A ^ ! H(z)e- A ^dx 



Hence, noticing that 

x d 

d(x) = — logn s (x) 

s ox 



the solution to the first equation in (|3.4[) is 

9{y) = — TT / K s {z)dz, 



o 



Ks(y) 

whereas the solution to the second equation in (|3.4p is 



V s (y) 

which is the first identity of (|3.3p ; the second follows by observing that 
Jq 00 h(x)n s (x) = 0. □ 

Before developing the Stein's method machinery further we need two 
more lemmas, the first of which is well known. 

Lemma 3.3 (Gaussian Mills Ratio). For x,s > 0, 

ex p(fi) f x e Minr) dt < min { YY' ^} • 

Lemma 3.4. If d(x) is defined by (|3.5p . then for s ^ 1 

, , V2T(s) r 
^ -d(x) < _V V ; < V2 

v^r(s- 5 ) 

< -xd(x) < 2(s - 1). 
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Proof. To prove the first assertion, note that f)2. 1|) and Lemma 12.21 imply 

< V2(s - i)r( s - 1) 
" v^r( s -i) 

The claimed bound now follows from Lemma 12.71 

For the second assertion, we use f|2.4j) in the second equality below to 
find 

o rrl I -, 1 - 2 S ~ / - 1 J, 

X 



» f/(»-l.i.f|) 2 ' £/(»-l, ifj) 

Applying Lemma 12.21 to (j3.8j) proves the remaining assertion. □ 
Lemma 3.5. Zei g satisfy the first equation of (|3,4p . 

• If h is non-negative and bounded, then for all x > anci s ^ 1, 

| 5 (x)|^||/ l ||min|iy|,i}. (3.9) 

• Ifhis absolutely continuous with bounded derivative, then for all s ^ 1 

\\9\\<W\\(i + ^- (3.10) 

Proof. It is easy to verify that we may write 

9{x) = i—r I K s {y)h{y)dy = — / K s (y)h(y)dy. 

SK S (X) J SK S (X) J x 

Thus if h > with \\h\\ < 00, then for all s ^ 1, 

\9(x)\ < k , [ K s (y)dy < mini 4/^, - 



where we have used Lemma Y2M, this shows (13. 9ft . 

If ||/i'|| < 00, then without loss of generality we can assume that h(0) = 
so that for x ^ 0, ^ ll^'ll^- m particular, h(x) ^ (x + ]&Z s )\\h'\\ so 

noting that EZ S < 

= 1, we can now use Lemma 12.61 to find 

1 < \\< ¥1 r f < w <r H^ll ( S?v«*(y)dy , M 
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To bound the integral in this last expression, we make the change of variable 

2 s 



y~ — z and apply (|2.9p to find 



r . r e -. g (._ 1| . i , )J ,_.g(;-i.-*.a < . i 



«■<*) *•(*)./£ v ^-i,if) 

where the last inequality follows from Lemma 12.21 □ 
Lemma 3.6. Let f be defined as in (|3.3p . 

• If h is non-negative and bounded and s ^ 1, t/ien 

H/'ll < V^\\h\\. (3.11) 

• If h is non-negative, bounded and absolutely continuous with bounded 
derivative and s ^ 1, then 

||/"|| <2^7rx/i+^ \\h\\. (3.12) 

J/s = 1/2, then 

\\f"\\^4\\h\\. (3.13) 

• If h is absolutely continuous with bounded derivative and s ^ 1, then 

\\r\\<»{s + l + ^)\\h% (3-14) 

If s = 1/2, i/ien 

H/'l < 4||/i'||. (3.15) 

Proof. If either h is bounded or absolutely continuous with bounded deriva- 
tive, then Lemma 13.51 implies that / satisfies the second equation of (|3.4p 
for g bounded. Thus 

If s ^ 1, then since V s (x) = U(s — 1, ^, is non-increasing for positive x, 
we find 

x\\g\\. (3.16) 

Now, again by (|3.4p . we have 



|/'(x)| < |d(x)/(x)| + HffH < HffH (|xd(x)| + 1) < ||ff||(2 S - 1), (3.17) 
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where we have used (|3.16p in the first inequality and Lemma 13.41 in the 
second. Applying the bound (|3.9p proves (|3.1ip . 

To bound /" for h having \\h'\\ < oo, let s ^ 1/2 and differentiate (|3.2p 
to find 

f"'{x) -§/"(*) = — /'(x) + ^ 
s s 

which implies 

i («P(=#) A.)) = exp(#) (^/'(x) + . 
Integrating, we obtain 

ex P (^)/"(x) = - /°°exp(^) (^/'(y) + ^)dy, 
so that Lemma 13.31 yields 

\f"(x)\ < (2, - 1)||/'|| min 1} + I exp(fj) jf" exp(^) /»'(y)dy. 

(3.18) 

If \\h\\ < oo, then an integration by parts yields a bound on the second term 
of (|5ilg|l which yields 

l/"WK(2 S -DII/'l|min{y|,I} + iH. 

If s ^ 1, then apply the bound (|3.1ip above on ||/'|| to find f|3. 12[) : for 
s = 1/2, (|3.13p follows immediately. Alternatively, we can apply Lemma [3. 31 
directly to (f3~T8|) to find 




(3.19) 



Finally, differentiating (|3.2p yields 

s\f"(x)\ ^ \xf"(x)\ + (2s - 1)||/'|| + \\h% (3.20) 

the first term can be bounded by (13.19p . and if s ^ 1 a subsequent applica- 
tion of (USD on || /'|| and then (IBTlOjl on ||#|| yields fOg]) . If s = 1/2, then 
(pjlgj) follows from (|330j) and ([8A9]l . □ 
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In order to obtain the bounds for the Kolmogorov metric, we need to 
introduce the smoothed half-line indicator function 

1 r e 

K,e(x) 



i r 

- / l[x < a + s]ds. (3.21) 
e Jo 



Lemma 3.7. If Z ~ K s and W is a non-negative random variable and 
s 1, then, for all e > 0, 

d K (^(iy),K s ) ^ suplEV^W) - E/»a, 6 (Z)| + e^2. 



If s = 1/2, i/ien, /or a// e > 0, 

d K (^W,iri /2 ) < snp\m a , £ (W) -Eh a>£ (Z)\ +ey^. 

Proof. The lemma follows from a well known argument and the following 
bounds on the density k s (x). If s ^ 1, then k s (x) is decreasing in x and 
from (BTBb. 



«,(o = . 1 ^ ^ A 

where the inequality is by Lemma 12,71 If s = 1/2, then k s (x) = 2xe~ x2 
which has maximum y/2/e. □ 

We will also need the following "indirect" concentration inequality; it 
follows from the arguments of the proof of Lemma 13.71 immediately above. 

Lemma 3.8. If Z ~ K s and W is a non-negative random variable and 
s 1, then, for all a < b, 

F(a< W^b) < V2(b-a) + 2d K (^(W),K s ). 

If s = 1/2, i/ien, /or all ^ a < 6, 

P(a < IF < 6) < 727e(6 - a) + 2 d K (J2f (W), K B ) . 

Lemma 3.9. If f satisfies (|3.2p /or /i a|£ and s ^ 1, i/ien 

s|/"(x + 1) - /"(x)| < |t| (2x{tt^ + 1) + (2s - 1)V2tt ' 

j /-tVO 

H — / I [a < x + u ^ a + e]du. 

e JtAQ 

If s = 1/2, i/ien 

3l/"(s + t) - /"(a?) | < 4|t|x + -/ I[a < x + u sC a + e]du. 
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Proof. Using (|3.2p . we obtain 

s(f"(x + t) - f"(x)) = x(f'(x + i) - f(x)) + tf\x + t) 

+ 2(s - l)(f(x + t) - f{x)) + h a>£ (x + i) - h a , £ {x), 

hence 

s\f"(x + 1) - f"{x)\ ^ \t\ (x\\f\\ + ll/'H + 2(s - 1)||/'||) 

j /-tVO 

H — / I [a < x + u ^ a + ejdit. 



Applying the bounds of Lemma 13,61 yields the claim. □ 

Lemma 3.10. Let W be a non-negative random variable with MW 2 = 1 
and let W* be the s-TDSB of W as in Definition \ 1.31 for some s ^ 1/2. For 
every twice differ entiable function f with /(0) = /'(0) = and such that the 
expectations below are well defined, we have 

slSf"(W*) = E{Wf'(W) + 2(s - l)f{W)} 

Proof. The lemma will follow from two facts: 

• If W" has the double size bias distribution of W, then for all g with 
E|W 2 g(J40| < oo, 

mg(W") = 1E{W 2 g(W)}. 

• If g is a function such that #'(0) = g(0) = with E|g"(V)| < oo, then 

slEg"(V)=g'(l) + 2(s-l)g(l). 

The first item above is easy to verify from the definition of the size bias 
distribution and the fact that EVF 2 = 1, and the second follows from a 
simple calculation after noting that V has density (2 — -) — 2x(l — i) for 
< x < 1. 

By conditioning on W" and using the second fact above for g(t) = 
f(tW")/{W") 2 , we find 

sEm ,. E {£gP +2(s _ 1) /^}, 

and applying the first fact above proves the lemma. □ 
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Proof of Wasserstein bound of Theorem M.SX Making use of Lemma 13 .21 and 
Lemma 13.101 we have 

Eh(W) - m(Z) = E{a/"(W) - Wf{W) - 2(s - l)f{W)} 
= sTE{f"(W)-f"(W*)}, 

where / is given by (|3.3|) . If h is Lipschitz continuous, then / is three times 
differentiable almost everywhere and we have 

\Eh(w) - m(z)\ ^ s\\f"\mw - W*\. 

We now obtain f)l .3|) by invoking (|3.14p and f)3. 15|) of Lemma 13.61 □ 

Proof of Kolmogorov bound of Theorem M.SX Fix a > and let s > 0, to be 
chosen later. Let / as in (|3.3p with respect to h a ^ £ from (|3.2ip . Define the 
indicator random variable J = I[| W — W*\ ^ (3]. Now, 

m atE (w) - m a , e (z) 

= sM{f(W)-f"(W*)} 

= sE{J(f"(W) - f"(W*))} + sE{(l - J)(f"(W) - f"(W*))} 

= : R± + i?2- 
If s ^ 1, using (|3.12p from Lemma 13.61 implies 

life | ^4(vrs 3/2 + l)F(|VF-T^*| > /3) < 17s 3/2 P(\W - W*\ > 0). 
Applying Lemma 13.91 

1 r p 

\Ri\ ^ /3(2EW(7iVs + l) + (2s-l)\/27r) + - / P(a < W + u sC a + e)du. 

Noticing that ET^ ^ 1 and applying Lemma 13^81 to the integrand, 

| ^ 12s/3 + 2/3e" 1 (v / 2e + 25) < 15s/3 + 4/fe~ 1 <5 

where <5 = d K {^f{W),K s ). 
From Lemma \'6.7\ we have 

£< v / 2e + 15s/3 + 4/?e- 1 5 + 17s 3/2 F(|iy-VF*| > /3). 

Choosing e = 8/3 and solving for <5, 

5 < 16V2s/3 + 30s/3 + 34s 3/2 F(|W-M/*| > $). 

which yields p. 41) . 

A nearly identical argument yields the statement for s = 1/2. □ 
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4 PROOF OF THEOREM O 

We first reformulate Theorem II, II in terms of a generalized Polya urn model. 
An urn initially contains i black balls and j white balls and at each step a 
ball is drawn. If the ball drawn is black, it is returned to the urn along with 
an additional a black balls and (3 white balls; if the ball drawn is white, 
the ball is returned to the urn along with an additional 7 black balls and 
5 white balls. We use the notation (a, /3; 7, 5)" . to denote the distribution 
of the number of white balls in this model after n draws and replacements. 
For example, (a, j3; 7, 5)®j has a single point mass at j. 

Theorem II. lb . Let n ^ 1 and i ^ be integers and Jsf(W n i) = 
(2,0; 1, 1)2 1- Ifb'^ li = EW^j, then, for some constant C independent ofn, 

dK{^(W n ,i/b nji ),K [i+1)/2 ) < 4=- 



Theorem 11.11 follows immediately from Theorem II. lb after noting that for 
Model 1 with i ^ 2, the degree of the ith. vertex when the graph has n 
vertices has distribution (2, 0; 1, l) 2i ~g x , and for Model 2 with i ^ 1, the 
degree of the ith vertex when the graph has n vertices has distribution 
(2,0; M)^ 1 . 

Let W := W n j have distribution (2,0; 1, l)f x . We will use (\1A\i to prove 
Theorem II. lb and so we will show that there is a close coupling of W and 
VW", where V and Y are is as in Definition 11.31 with s = (i + l)/2. This 
result will follow from the following lemmas proved at the end of this section. 

Lemma 4.1. There is a coupling (Z,W") of (2, 0; 1, 1)™3 1 and the double 
size bias distribution of (2,0; 1, 1)^ satisfying P(Z / W h ) ^ C/^/n. 

Lemma 4.2. 

• The distribution of (2,0; 1,1)^ given the first ball drawn is white is 
equal to (2, 0; 1, 1)"~^ 2 . 

• The distribution of (2,0; 1,1)^ given the first ball drawn is black is 
equal to (2,0; 1, l)"+ 2 ^. 

Lemma 4.3. For Z as in Lemma\4-1\ 



(2,0;1,1)™ + ^ 2 = (1,0;0,1)^ 3 and (2, 0; 1, 1)^ = (1, 0; 0, l)^ 3 , 



where the notation (1, 0; 0, l)f 2 3 is understood by the relation [(1,0; 0, l)f 2 3 \Z 
z] = (l,0;0,l)f- 2 3 . 
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Lemma 4.4. Let U\ and U 2 uniform (0, 1) random variables, independent 
of each other and of Z , defined as in Lemma \4-l\ Then there exists ran- 
dom variables X\ with distribution (1,0; 0, l)f^ an d X 2 with distribution 
(1,0; 0, l)^ 3 such that 

\Xi - Zmax(U 1 ,U 2 )\ < 3 and \X 2 - Zmm(Ux, U 2 )\ < 3. 

Prom these lemmas, we can now easily prove Theorem II. lb : here and 
below we use C to denote a generic constant that may differ from line to 
line. 

Proof of Theorem 1.1b. Let (Z,W") be defined as in 14.11 above and let Y 
be a Bernoulli(l/(l + i)) random variable; the parameter is equal to the 
probability that the first ball drawn under the (2, 0; 1, l)i t i urn rule is white. 
Lemmas 14. 2| I4.3| and 14.41 imply that we can couple W and VZ together so 
that \W — VZ\ < 3 almost surely. Thus, using Lemma 14.11 

P(|W - VW"\ > 3) ^ P(W" ^ Z) < C/^n~, 

and the theorem follows from (jl.4|) taking f3 = 3/b, noting that for c > 0, 
(cW)' = cW, and using b 1 ^ Cn from Lemma 14.71 □ 

We have left to prove Lemmas l4. 1114.21 14.31 and !4.4l The proof of Lemma 
14.11 is the most involved, so we postpone it until the end of this section. 
Lemma 14.21 is obvious and the proofs for Lemmas 14.31 and 14.41 immediately 
follow. 

Proof of Lemma \4-3\ Consider an urn with i green balls, 1 black ball and 2 
white balls. A ball is drawn at random and replaced in the urn along with 
another ball of the same color plus an additional green ball. 

If X is the number of times a non-green ball is drawn in n — 1 draws, 
the number of white balls after n — 1 draws is distributed as (1,0; 0, 1)^ 2 • 
Since X + 3 is distributed as (2,0; 1, l)™^ 1 and the number of white balls 
after n — 1 draws has distribution (2,0; 1, l)27i2' * ne ^ rs * e< l ua tio n follows. 
The second equation follows from similar considerations. □ 

Proof of Lemma \4-4\ We will show that for U\ and U 2 independent uniform 
(0, 1) random variables, there exists random variables N and M such that 
Sf(N) = (1,0;0,1)™2 3 , Sf(M) =Sf(n-N), and 

\N - nmax(Ui,U 2 )\ < 3 and \M - rtmin(Lq, U 2 )\ < 3 a.s. 



20 



The formulas of burred (|201cL p. 206) imply that (1,0; 0, l)^ 3 has 
distribution function 

w-(^t)(^)- («) 

and it is straightforward to verify that 

iV := max([(n - l)C/i], 1 + \(n - 2)U 2 ]), 

has the same distribution. We find \N — n max([/i, < 3 and thus a 
coupling satisfying the first claim above. Defining 

M := mm(\(n - 1)U{\ , 1 + \(n - 2)U 2 ]), 

IfEIjl implies JS?(M) = if (n - iV), and |M - nmin(E7i, C/ 2 )| < 3. □ 

We now only need to prove Lemma 14.11 which is the content of the re- 
mainder of this section. The next lemma helps provide a useful construction 
for the double size bias distribution of a sum of indicators. 

Lemma 4.5. Let W = Y27=i-^i> w here the X{ are Bernoulli random vari- 
ables and b 2 := Elf 2 . For each j,k G {1, . . . , n}, let (X^ 1 ^^,. fc have the 
distribution of (XjW,- & conditional on Xj = X k = 1 and let J and K be 
random variables independent of the variables above satisfying 

P{J = j,K = k) = ^^, j,k>l. 

Note that J and K can be equal and in this case the notation i / j,k reduces 
to i 7^ j. In this notation, 

W" = X i J ' K) + 2-I[J = K] 

has the double size bias distribution ofW. 
Proof. Note that 

E/(VF") = b~ 2 ®(XjX k )-E{f (W) \Xj = X k = 1} 

j,k 

= b~ 2 nXjX k f (W)} = b~ 2 TE{W 2 f (W)}; 
this is exactly flL2]). □ 
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To simplify the notation we consider i fixed in what follows. We write 



j=0 

where for j ^ 1, Xj is the indicator that a white ball is drawn on draw j 
from the (2,0; 1, l)n urn and Xq = 1 to represent the initial white ball in 
the urn. We will then define random variables M^ k such that 

JSf (Af**) = Sf(W\Xj =X k = l), (4.2) 



k 



so that by Lemma [4. 5| if J and K are random variables independent of Mh 
satisfying 

I>(K = k,J = j) = m( - X £ Xj \ j,k>0, (4.3) 

for b 2 := E<W 2 , then Mn' K has the double size bias distribution of W. 

In order to generate a variable satisfying (|4.2p for j < k, we use the 
following lemma that yields a method to construct an urn process having the 
law of the (2, 0; 1, 1)^1 urn process up to time n conditional on Xf. = Xj = 1. 
This conditioned process follows the law of the (2, 0, 1, 1, )j 3 urn process up 
to (and including) draw j — 1. At draw j, exactly one black ball is added 
and then draws j + 1 through k — 1 follow the (2,0; 1, 1) urn law. Again 
at draw k exactly one black ball is added and then the process continues 
to draw n following the (2,0; 1, 1) urn rule. We write M^ k to denote the 
number of white balls after n draws for this process, and we refer to this as 
the M J,fc process. The next lemma shows that this construction of M^ k has 
the distribution specified in (j4.2|) . 

Lemma 4.6. Retaining the notation and definitions above and letting A m = 
{X m = 1}, ifl^j<k^n then 

P(A J \A kl W j - 1 ) = ^^, 
and ifl^l<j<k^n 

2 + Wi 1 

Proof. By B ayes' rule we have 

PML4 A W UA] 

nMA^A^w^) p^.,^) ( 4 - 4 ) 
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and we next will calculate the probabilities above. For j ^ 1, we have 
which implies that for k ^ j, 



i + 2k-l 
and 

P(AM^ J _0 = 52^t 1 ). (4 . 7) 

Now, to compute the conditional expectations appearing above, note first 
that 

EWWW = W M + = (-^) HV-,, (4.8) 

and conditioning on Wj-i and taking expectations yields 

E(W fc |T^-i) = ( -^fc 2 -l ) W-il^-i) 
and then iterating and substituting k — 1 for /c yields 

^-.w-.)=n( 4^-7)-i )^- (49) 

and also 

E(W fc _ 1 |A i ,W i _ 1 )= ff ( i + 2(fc-7)-l ) (1 + ^- l) - (410) 
We use a similar approach to obtain 

E(Wfc |Wfc_i) = W fe 2 _x f 1 - -^rl + (Wfc-! + I) 2 



i + 2k-\) v ' z + 2k - 1 



i + 2k - 1 7 K_1 i + 2k-l 
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which can then be added to (|4.8p while letting Dk = Wjt(l + Wk) to obtain 

MD k \W k -i) = * + 2 f + ? D fc _ 1 
v 1 7 i+2fc- 1 

and thus 

_|_ OZ- _|_ i 

E(2? k ^ H ) = .^I^ Pfc-ll^j-l). 
Iterating and substituting k — 1 for A; gives 



and also 



j _l_ 2fc — 1 

Ep^A,-, = (^-i + l)(W,-_i + 2). 

2 ~r ~T~ -L 



Letting 

= 1 "t^t 1 / i + 2(k-t) 

° (i + 2j-l)(t + 2fe-l) Al ^ i + 2 (fc-t)-l 

and applying (|4.7p and (|4.10p we have 

PC^lWi-i) = E(P(A J -|^_ 1 )P(A fe |A j ,W 3 -_ 1 )|W z _ 1 ) 
= cE(D i _i|W,_i) 
_ Z + 2J-1 , 
°i + 2Z — 1 

which also implies 



^(l + Wi-i] 



P(A,-A fc | A, = c **^ * (1 + Wi_i)(2 + Wi-x). 

Substituting these expressions appropriately into (|4.4p and (|4.5|) proves the 
lemma. □ 

We are now ready to prove Lemma l4.1i 

Proof of Lemma \4-1\ The (2, 0; 1, 1)^3 process and the M^ k process defined 
above differ only in that in the latter process, after either draw j or k a 
single black ball is added into the urn regardless of what is drawn; in the 
former process, two balls are always added to the urn depending on the 
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color drawn. This difference turns out to be small enough to allow a close 
coupling as stated in the lemma. 

For each t we will construct (zt, ml''*) to respectively denote the indicator 
for the event that a white ball is added to the urn after draw number t for 
the (2,0; 1, 1)^3 process and for the M J,fc process, and we write 



z n = y~]zt, 



±v± n 



t=0 



t=0 



to denote the number of white balls in the urn after draw n for each process. 
Let Ut be independent uniform (0, 1) random variables. We define 



zt = l 

and for t 7^ k,t 7^ j we define 



U t < 



Z t -i 



i + 2t + 1 



u t < 



i + 2t + 1 - I{t > j} - I{t > k} 



We also set w?^ 1 = m^' k 



since a single black ball is added after draws 

j and k. Using that the event M^ k 7= Z n -\ can be written as a union of 
the events that index t is the least index such that z t / m\' k , we have for 
< j < k, 



P(M^V^-i) 

n 

< E(z fc + Zj + m„) + P 



3 



< 9EX 3 - + 3^EWt_i 



i + 2t + 1 
1 1 



< U t < 



H-X 



i + 2t - 1 



i + 2t - 1 i + 2i + 1 



i—Vof-^M M 

+ ^.\li\i + 2t-l i + 2t + l) 



where we have used Lemma 14.71 below and Mzj ^ 3EX,- . Defining J, K as 
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in (|4.3p we then have 

F(M n J ^/Z n _ 1 ) 

2C i 

< P( J = 0) + P(if = 0) + P( J = K) + -p- ^2 r*®XjXk 

3 

Lemma 4.7. Fix i ^ 1 and let W n = Y^j=oXj w ^ ere f or j ^ 1) Xj is 
the indicator that a white ball is drawn on draw j from the (2,0; 1, l)n urn 
and Xq = 1. Retaining the other notation and definitions above, for all 
1 j < k ^ n, 



i — in 1 9 „ i + 2n + 1 

EWn < >/2^F\/t^2 + 2' n ^ (2 " ^ i+I : 

2tt(1 + V^) 



(i + 2) v / (i + 2j)(i + 2A;- 1) ' 



./7T 



V(i + l)(i + 2j-l) 
Proof. Prom (|4.9h we have 



/ 

from (14.61) we find 



EW n = TT 1 = . l> — . 2 .,?( , 4.12 

ll, + 2t-i r(^ + i)r(n + ^i)' 



EX, - 3 



3 i + 2j-V 
and using ([4T7]) and (|4.10p yields 

w)^ss»n i+2> 



% + 2k-l 11 i + 2t— 1 



Now using (|4.1ip we find 



9 i + 2n + 1 

EW 2 = 2— - EW n . 

n z + 1 
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Lemma 12.71 applied to (|4,12p implies 



1 



Phi 



n 



i + It 




2n 



+ 1 




t=l 



i + 2t-l 



i + 2 



and collecting the facts above yields the lemma. 



□ 
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